Quantum Machines 



Mark Hillery 
Department of Physics 
Hunter College of CUNY 
695 Park Avenue 
New York, NY 10061, USA 

Vladimir Buzek 
Research Center for Quantum Information 
Slovak Academy of Sciences 

Dubravska cesta 9 
845 11 Bratislava, Solvakia 

March 25, 2009 

Abstract 

We discuss quantum information processing machines. We start with 
single purpose machines that either redistribute quantum information or 
identify quantum states. We then move on to machines that can perform 
a number of functions, with the function they perform being determined 
by a program, which is itself a quantum state. Examples of both de- 
terministic and probabilistic programmable machines are given, and we 
conclude with a discussion of the utility of quantum programs. 

1 Introduction 

Quantum information is information stored in a quantum mechanical system 
[1 . The systems themselves arc cither collections of qubits, two-dimensional 
systems, or qudits, .D-dimcnsional systems. For example, a qubit exists in a two 
dimensional space spanned by two orthonormal states, which are denoted by |0) 
and |1). While a classical bit can be either or 1, a qubit can be in any superpo- 
sition of |0) and |1), i.e. = a\0) + (3\1), where a and (3 are complex numbers 
satisfying \a\ 2 + \(3\ 2 = 1. This fact leads to large differences in the properties 
of classical and quantum information. Once one has information is in the form 
of qubits of qudits, one would like to do something with it. Ultimately, in order 
to extract the information stored in the system, it will be necessary to measure 
the it, but before doing so, it is usually useful to perform some operations on 
the system. For example, one might want to (approximately) copy it, compare 



it with known quantum states or with information stored in other quantum sys- 
tems, or perform an operation on it. The devices that accomplish these tasks 
are quantum machines. They can either be single-purpose or programmable and 
able to perform many tasks. 

In the case of programmable systems, the program can be either quantum 
or classical. Examples of a classical programs are a sequence of laser pulses to 
put a molecule into a particular quantum state or a sequence of radio frequency 
pulses to control the dynamics of a system of nuclear spins [2]. Here, we want to 
consider programs that are themselves states of quantum systems, i.e. we want 
to perform quantum programming. Quantum programs have some properties 
that classical ones do not. For example, they can exist in superposition states, 
which means that a quantum processor can perform several programs simulta- 
neously. In addition, quantum programs are necessary when the information on 
which the program is based is, in fact, quantum information. We shall provide 
examples of each of this situations. 

As we shall see, in a number of cases, it is not possible to perform the 
desired task perfectly. For example, the no-cloning theorem prohibits the perfect 
copying of quantum information [3]- We are then faced with deciding how to 
perform the task as best as we can. This usually amounts to adopting one of 
two possible strategies. The first is to produce an output state that is as close 
as possible to the ideal output, that is we approximately perform the desired 
task. The second is to perform the task with some probability in such a way 
that we know whether the proper task has been performed or not. In that case, 
the task has been performed probabilistically. 

We shall begin by discussing single-purpose machines, in particular don- 
ers and machines that perform the discrimination of quantum states. Next, 
we shall move on to a general discussion of programmable machines and out- 
line the deterministic and probabilistic approaches to them. We will present a 
no-go theorem that shows that a deterministic programmable machine cannot 
be universal. We then consider the problem of implementing a one-parameter 
unitary group on a programmable machine and discuss the approximate and 
probabilistic strategies. We also show how, in the probabilistic case, to increase 
the probability of success by increasing the size of the program space. Finally, 
we make a case for why quantum programs are useful. 

2 Singe-purpose machines 
2.1 Cloners 

As was mentioned in the Introduction, the no-cloning theorem prohibits the 
perfect copying of quantum information. The theorem states that if we have 
a quantum system in the state we cannot build a machine whose output 
will be l^)!?/'), for all \tp). The proof relies simply on the linearity of quan- 
tum mechanics [3 . If the machine clones each vector of a basis for the input 
space, then its action is completely determined, and it is incompatible with the 
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transformation — > 

An approximate doner for qubits can be constructed from four Controllcd- 
NOT gates and three qubits (see Figure 1) |H|5j|6]. A Controlled-NOT gate 
is a two-qubit gate. The first qubit is the control qubit, and the second is the 
target qubit. If the state of the control qubit is |0), then nothing happens to 
the states of either the control or the target qubits. However, if the state of the 
control qubit is |1), then the state of the control qubit is again unchanged, but 
the state of the target qubit is flipped, i.e. if it was |0) it becomes |1), and vice 
versa. Returning now to the doner, we have that the first qubit, in the state 
\ip) = a\0) + /3|1), where {|0), |1)} is the orthonormal qubit basis, is the one 
to be copied, and second qubit will become the copy. In order to see how this 
works, define the two two-qubit states 

|Soo> = ^(|0>|0> + |1>|1» 

|3 te > = ^|0>(|0> + |1». (1) 

We now note that if qubit 1 is in the state \ip)i and qubits 2 and 3 are in one 
of the two states above, then the cloning circuit will implement the following 
transformations 

|V'>l| 3 00>23 |-0>l|S OO >23 

|^>l|30x>23 -> |V>} 2 |S o)l3- (2) 

Examining these equations, we see that in the first the quantum information 
from the first qubit appears in output 1, and in the second it appears in output 

2. This suggests that if instead of sending either |S o) or |S a;) into inputs 2 and 

3, we send in a linear combination of them, some of the quantum information 
from qubit 1 will appear in output 1 and some of it will appear in output 2, 
thereby cloning the state. This is, in fact, exactly what happens. If we choose 

|*>23 = Co|3oo)23 + Cl|S 0a; )23, (3) 

as the input state for qubits 2 and 3, with cq and c\ real for simplicity, then the 
reduced density matrices for outputs 1 and 2 are 

p[° Ut) = (c? + c oCl M<V>|+f/ 

P { r t] = {ci+c^m+^i, (4) 

Note that by choosing cq and c\ we can control how much information about 
|^) goes to which output. In particular, if we choose Cq = c\ = l/\/3, then the 
information is divided equally, and we find that 

p ^= P t t] = l\m\ + ^x><M (5) 
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Figure 1: A circuit for an approximate quantum doner with three qubits and 
four Controlled-NOT gates. The qubit to be copied goes into input 1, and the 
copies come out in outputs 1 and 2. The anti-clone comes out of output 3. In 
the symbol for the Controlled-NOT gate, the filled circle indicate the control 
qubit, and the open circles indicate the target qubit. 



where |^>j_) is the qubit state orthogonal to \ip). Therefore, the fidelity of the 



doner output pf wt ' (or p^™* 1 , since they are the same in this case) to the ideal 
output, \tp), which is given by (i>\p { ° ut) \il>), is 5/6. A fidelity of one would imply 
perfect cloning, so what we have here is a device that produces two copies of 
the input qubit that are pretty good approximations to it. 

Note that the doner employs three qubits, and we have only discussed the 
final state of two of them. One might wonder if the output state of the third 
qubit is of interest. The answer "yes." Its state is the best approximation to 
the state orthogonal to that of the input qubit that can be realized. A machine 
that sends a qubit in an arbitrary input state into the state orthogonal to 
it, \ip±), is known as a universal-NOT, or UNOT, gate, and this transformation 
is also impossible to perform exactly [7j [8]. It can, however, be performed 
approximately, and the best fidelity that can be obtained is 2/3. This can 
be achieved by measuring the original qubit along an arbitrary axis and then 
producing an output qubit whose state is orthogonal to the state obtained as 
the result of the measurement. For example, if one measured along the z axis 
and found the result +z, one would create a qubit in the — z direction. The 
same result can be achieved by taking the third qubit, the one which is not a 
clone, from the output of a quantum cloning machine. This output qubit is 
sometimes referred to as an anticlonc. 

There have been a number of realizations of a quantum cloning machine, 
most based on a device known as an optical parametric amplifier [9l [lOj [11] . 
This device takes one photon at frequency 2u> and converts it into two photons 
at frequency u>. A strong beam at 2u will amplify a weaker beam at frequency ui 
via stimulated emission. When this device is used as a doner, the qubits are the 
polarization states of the photons. A photon in an arbitrary polarization state 
at frequency u> will produce three photons, all at frequency to, at the output. 
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Two of them will approximate clones, and the third will be an approximate 
anticlone. 

A second cloning strategy is a probabilistic one |12j . In this case, one wants 
to clone a quantum state that is selected from a known, finite set of states. 
For simplicity, let us assume that this set contains two elements, \tpx) and IV^)- 
Our machine is then to do the following. Given an input qubit that is in either 
the state \tpi) or |?/>2), we don't know which, it is to produce two copies of 
the input state. If the two input states are not orthogonal, this cannot be 
done perfectly. It can, however, be done probabilistically. The machine either 
produces two perfect copies of the input, or it fails, and it tells us which of these 
two possibilities has occurred. The probability of successfully cloning the input 
is given by 

Psucc = i + |(^| y, 2 )r (6) 

Note that this is one if the states are orthogonal, and decreases as their overlap 
increases. 



2.2 State discriminators 

The problem in quantum state discrimination is, given a particle in an unknown 
state selected from a known set of states, determine the quantum state of the 
particle [T3J Q3] . If the set of possible states contains states that are not orthog- 
onal to each other, then this cannot be done perfectly. Again, for simplicity, let 
us assume that our set of possible states contains only two states, \ipi) and \tp2)- 
We shall again explore two strategies for accomplishing this task. The first 
is the minimum-error strategy, which is a strategy that approximately discrim- 
inates the two states |15j . It can make mistakes, but the probability of making 
a mistake is minimized. A machine that implements this strategy is given an 
input, which is equally likely to be or \ip2), and it then tells us which of the 
two states it was given. The probability of the output being incorrect is 

Perr = i(i - yRw). (?) 

Note that when the states are orthogonal, this is zero, but that it increases 
as the overlap between the states increases. The second strategy is known as 
unambiguous state discrimination [16\ 117} 118] , In this case our machine has 
three outputs, one corresponding to state 1, one corresponding to state 2, and 
a third corresponding to failure. This machine will never incorrectly identify 
a state, but it may fail. For example, if the input is in \tpi), the machine will 
either tell us the input was in state \ipi), or fail, but it will never tell us the 
input was in state \ip2)- Assuming that each input state is equally likely, the 
probability of successfully identifying the state is 

P«oc=l-K^i|^2>|. (8) 



5 



As has been the case before, the probability of successful identifying the input 
state is one if the states are orthogonal, and decreases as the overlap of the 
states increases. 



3 Programmable machines 

We now want to consider programmable quantum machines, which we shall 
often refer to as quantum processors [15] . These have two inputs, one for the 
data, which is to be acted upon, and one for the program, which will specify 
the operation to be performed on the data. Both the data and the program 
are quantum states. In particular, the processor is a unitary operator acting 
on the Hilbert space TLd <8 'Hp, where Hd is the data Hilbert space and H p is 
the program Hilbert space. The machine can act in either a deterministic or 
probabilistic fashion. In the case of a deterministic machine, we always accept 
the output, and the action of the machine on the data state is described by a 
trace-preserving completely positive map, which is a result of tracing out the 
program state output. In the case of a probabilistic machine, we measure the 
program state output, and only accept the data state output if a particular 
result is obtained. We shall examine both scenarios. 

It is, perhaps, best to begin with an example [20 . Let us go back and 
consider the three-qubit circuit for the approximate doner. Qubit 1 will now 
be our data state, and qubits 2 and 3 will be our program. We will denote the 
data state by \ip)i and the program state by |S)23- Define the two-qubit Bell 
states to be 

l*±) = ^(|00) ± |11)) 

l*±) = ^(|01)±|10)). (9) 
If these states are used as programs in our processor, we find that 

Ml|*+>23 -> |l/>>l|#+)33 

|V')l|*-)23 -> cr z jV>)l|*->23 

|V>)l|$+)23 -> 0-x|^)l|$+>23 

|VM*->23 - (-«7 w )|V)l|*-)23 J (10) 

where a Xl a y , and a z are the Pauli matrices. If we choose the program 

1 2) 23 = C |*+) 2 3 + Ci|$+) 2 3 

+ C 2 |^->23 + C 3 |*_> 23 , (11) 

then operating our machine in the deterministic mode, by tracing out the pro- 
gram state output, we obtain for the data state output 

(out) I 1 2 (in) I 1 2 (in) 

Pi = M Pi +\Cl\ &xPl Ox 

I |2 (in) I i2 (in) / 1n \ 

\c 2 \ dypi 'vy + \c 3 \ a z p\ 'a z . (12) 
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In the above equation, we have set p " — Examining the output state, 

we see that this circuit can implement a number of quantum channels: the bit- 
flip channel (c 2 = C3 = 0), which flips a bit, with a certain probability, the 
phase-flip channel (cj = c 2 = 0), which sends |0) — > |0) and |1) — > — with a 
certain probability, and the depolarizing channel (c\ = c 2 — C3), in which the 
input state is replaced by the completely mixed state, with a certain probability 
PQ. The same processor can be used in the probabilistic mode. Suppose we 
want to implement the operator A = I — 2|0)(0| on the data state, where \<f>) 
is a specified one-qubit state. The operator A is similar to a z , but instead of 
flipping the phase of the state |1), it flips the phase of the state \(j>). Defining 
the two-qubit operator, Uq, 

[/ |Q0) = -|10) U \1Q) = 
CT |01) - |00) U \ll) = |01), (13) 

we choose for our program state 

IS)*, - ^U {\<t>h\<P±) 3 + \<t>x)Ma), (14) 

where \cj>±) is the qubit state orthogonal to \4>). At the program state output, 
we project onto the state (|<I> + )23 + | ) 23 + |*-)23)/v / 3, and if we get one, 
we keep the data state output. This will happen with a probability of 1/3, 
independent of the state |</>). If we do get one, then the data state output will 
be in the state A\tp). 

Now, let us return to deterministic processors and examine the resources that 
are necessary in order to implement a given set of operations on the data. Sup- 
pose that our data state is a qubit, and we want to implement a one-parameter 
unitary group U(a) = exp(iacr z ), where < a < 2n, on it. We want to en- 
code the angle a in the program state. It turns out that this cannot be done 
with a finite dimensional program space, due to a no-go theorem due to Nielsen 
and Chuang [3T]. It states that if the program |Si) implements the unitary 
operator Ui on the data state, and |S 2 ) implements the unitary operator [/ 2 , 
then (S]JS 2 ) = 0. This implies that for every unitary operator that the pro- 
cessor can implement on the data state, we need an extra dimension in the 
program space. In the case of our one-paramenter group, there are an infinite 
number of operators, so it clearly cannot be implemented on a processor with a 
finite-dimensional program space. 

Given this result, we can adopt the same strategies we did in the case of 
single-purpose machines that were prohibited by a no-go theorem. We have 
already seen in our example, that a probabilistic machine has no problem im- 
plementing an infinite number of operations. The only remaining issue in that 
case is figuring out how to make the success probability as large as possible. 
This we shall address shortly. The other strategy is to construct a machine that 
carries out a set of operations approximately. It is this type of machine we shall 
discuss now [23] . 
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Wc have seen that deterministic processors implement trace-preserving, com- 
pletely positive maps. Therefore, when considering approximate deterministic 
processor, we need to have some kind of a measure of how close two such maps 
are. We shall use the process fidelity [22 , which has a number of useful prop- 
erties [53]. Let T\ and T 2 be two trace-preserving, completely positive maps, 
mapping the space, B(Tt), of linear operators on a D-dimensional Hilbert space, 
TL, into itself. The Jamiolkowski isomorphism associates a density matrix on 
H ® TL to each trace-preserving, completely positive map on B(H). Letting 
{b)b = 1,2, . . . D} be an orthonormal basis for TL, define the maximally entan- 
gled state, |$) in TL <g> TL 

l*> = -7^I>'W>- ( 15 ) 

The density matrix associated with the trace-preserving, completely positive 
map, T, is 

p = (j®r)(|$)($|), (16) 

where T is the identity map. If p\ is the density matrix associated with 7\ and 
p2 is the density matrix associated with T 2 , the the process fidelity between T\ 
and T2 is 

FprodTun) = [Tr(^plp 2 ^pl) 1 / 2 ] 2 . (17) 



We will not discuss the case of a general approximate processor, but will 
look at a specific type. Suppose we have a processor that is a controlled-U 
gate. That means that if our program space, TL P , has dimension TV, there is an 
orthonormal basis of TL P , {\j)\j = 1,2,.. . N} such that the processor acts as 
follows 

\m) - u^m, as) 

where Uj is a unitary operator on the data space. Therefore, this processor 
implements the set of unitary operators S u = {Uj\j = 1,2,... N} perfectly. Now 
suppose we want to use this processor to approximate another unitary operator 
U, which is not in S u . We want to choose a program state that maximizes the 
process fidelity between the map it generates and U. What one finds is that the 
best program is one of the basis states \ j), and it is the one for which |Tr([/^r/j)| 
is a maximum. In this case, one simply chooses the unitary operator in S u that 
is closest to U and implements that operator. Using a program state that is a 
superposition of different basis vectors does not help. 

Let us look at an example of this situation. We wish to implement the 
operator, for < 9 < 2ir, 



U{6) = exp 



^(e- ie a+ + e ie a-) 



(19) 



on our data state, which is a qubit. Here < 9 < 2n, and a ± — (a x ± 
ia y )/2. Our program state has dimension N, and an orthonormal basis = 
1,2... N}. Define the operators E± on the program space by E±\j) — \j ± 1), 
where the addition and subtraction are modulo N. Now let the overall processor 
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unitary operator, which acts on the tensor product of the data and program 
spaces, be 

ITT 

G = cxp — (a + E_ + a~E+) , (20) 



and consider the program states 



N-l 

3=0 



(21) 



When 9 = 9 m = 2nm/N, for m an integer between and N — 1, we find that 



where \tp) is a general qubit state. Therefore, this processor implements the 
operations U(0 m ) perfectly. Now suppose we want to implement U{9) for a 
value of 9 that is not one of the 9 m . The optimal strategy is to find the 9 rn 
closest to 9 and to send in the program state \9 m ) corresponding to that value. 
If we do so we find that 



Rather than determining which 9 m is the best one to use, a simpler procedure 
is just to use the program state \9). There should be some cost to doing this, 
and, indeed, we find that the process fidelity in this case is approximately 1 — 
(2/N). The optimal program has an error that goes like 1/N 2 while the simpler 
procedure gives an error of 1/N. Determining whether the extra accuracy is 
worth the extra work in determining the best program would depend on the 
application. 

Now let us return to probabilistic programmable devices. Suppose our data 
system is a qubit, and we want to implement the one parameter group we men- 
tioned earlier, U(a) = exp(iaa z ), where < a < 2ir. This can be accomplished 
with a success probability of 1/2 by using a qubit program and a controlled- 
NOT gate. As was noted before, the controlled-NOT gate has two inputs, a 
control input and a target input. The state of the control qubit is not changed, 
and if the state of the control qubit is |0), neither is the state of the target 
qubit. However, if the control qubit is in the state |1), then the operator a x 
is applied to the target qubit. It is, in fact, a controlled-U gate with the two 
unitary operators being the identity and a x . In our case, the target qubit is the 
program and the control qubit is the data. The program states are 



G(\iP)\e m )) = U(6 m m\6 m ), 



(22) 



F proc (U(9),U(9 m ))> cos- 



(23) 



|S(«)> = ( e -|0> +e— 



(24) 




(25) 
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By measuring the program state output in the basis {|0), |l)},and keeping the 
result only if we get |0), which happens with a probability of 1/2, we obtain the 
data state output U(a)\ip), which is the desired result. 

A closely related programmable device has been recently realized experimen- 
tally |25j . It carries out the transformation 

a|0)+j8|l>->a|0)+e*/3|l), (26) 

where the angle tfi is encoded in a second qubit. The qubits are polarization 
states of photons, with \H) representing a horizontally polarized photon and 
\V) representing a vertically polarized one. A polarizing beam splitter, which 
transmits horizontally polarized photons and reflects vertically polarized ones 
is the main component of the device. The beam splitter has two input modes, 
which we shall label 1 and 2, and two output modes, which we shall also denote 
as 1 and 2. For a photon incident in input mode 1 we would have \H)± — > \H)i 
and \V)i — ► \V) 2 - Input mode 2 behaves similarly. If two photons, one in the 
state a\H)i + /?|V)i (data) and the other in the state (l/V2)(\H) 2 + e^\V) 2 ) 
(program) are incident on the polarizing beam splitter, then in the cases in which 
a single photon emerges from each output, which happens with a probability of 
1/2, the conditional output state is 

|VW> = -^(a|#)i|if> 2 + e^p\V)AVh)- (27) 

If we measure the second photon in the | ±) = ( 1 / v2) ( | if) ± | V ) ) basis, then the 
remaining photon is in either the state (l/^/2)(\H)+e l( ^\V}), if our measurement 
result was |+), and (l/V2)(\H) —e l ^\V)), if our measurement result was |— ). If 
we obtain the result |— ) we can apply a correcting operation on the remaining 
qubit that sends \H) — > \H) and \V) — > — \V). The final result is that this 
device implements the transformation 

a\H)+f3\V)^a\H) + e l *l3\V), (28) 

with a probability of 1/2. 

Suppose that we want to increase the probability of a successful outcome. 
One possibility is to try again if get the wrong result of our measurement on 
the program state [26, 27J. If we obtained the result |1) from our measurement, 
then the data qubit is in the state U~ 1 (a)\tp). We can take this quibit and run 
it through the processor again, but this time use the program |S(2a)). If we do 
so, the output state is 

|*' out ) = ±(U(aM\0) + U-\3a)\m))- (29) 

We again measure the program state and keep the result if we get |0) . This again 
happens with a probability of 1/2. Adding this second step has increased our 
overall success probability to 3/4, and the procedure can be repeated to bring 
the success probability as close to one as we wish. What we need to do this, 
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however, is a collection of qubits in the proper program states, that is, besides 
a qubit in the state |3(a)), we need an additional one in the state |S(2a)). 

We can also accomplish the same thing by enlarging our program space [27] • 
Our data space still consists of one qubit, but the program space now contains 
two qubits. Let us label the three inputs, input 1 being the data input, input 2 
the first program input and input 3 the second program input. The processor 
now consists of two gates. The first is a controlled-NOT gate whose control 
qubit is qubit 1 and whose target qubit is qubit 2. The second gate is a Toffoli 
gate. This gate has two control qubits and one target qubit. The states of 
the control qubits are not changed, and if they are in the states |0)|0), |0)|1), 
or |1)|0), neither is the state of the target qubit. However, if they are in the 
state |1)|1), then a x is applied to the target qubit. In our processor, qubits 1 
and 2 are the control qubits and qubit 3 is the target qubit. The input state is 
|^)i|S(a))2|S(2a))3, and the output state is 

WLt) = i[C/(a)|^)i(|0) 2 |0)3 + |0) 2 |l)3 + |l) 2 |0) 3 ) + C/- 1 (3a)|^)|l) 2 |l)3]. (30) 

At the output we measure the program qubits in the computational basis and 
keep the data state output if we get |0)|0), |0)|1), or |1)|0). If we do, the data 
output is in the state U(a)\ip), and we have achieved our goal. This happens 
with a probability of 3/4. By increasing the dimension of the program space 
further, we can increase our probability of success. We have, therefore, two 
strategies for increasing the success probability for a probabilistic processor. 

4 Why quantum programs? 

The programs in the quantum processors we have been discussing have been 
quantum states. One might wonder whether this is necessary and whether 
classical programs would suffice. That is, one could have gates that can perform 
a number of operations, but the selection of which operation they do perform 
is governed by a classical input. Do quantum programs provide an advantage? 
There are several scenarios that suggest themselves for which quantum programs 
would be useful. One is that the information on which the program is based is 
intrinsically quantum. We shall explore an example of this situation when we 
discuss programmable state discriminators. This could also occur if the program 
is the result of an earlier quantum computation. A second situation is one in 
which we would like to apply quantum information processing techniques, such 
as a Grover search, to programs. In that case, the programs must be quantum. 

Let us first consider programmable state discriminators. The first such de- 
vice was proposed by Buzek and Dusek [28 . Here we will discuss a different 
version, which is a type of universal state discriminator |29j . So far, when dis- 
cussing state discriminators, we have assumed we knew the set of states we were 
trying to discriminate among. This knowledge was built into the discrimina- 
tor. The resulting discriminator is useful for discriminating states from that 
particular set, but it is not useful for discriminating among members of other 
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sets of states. Suppose, however, that we would like a discriminator that would 
work for any set of states, i.e. a universal discriminator. In that case, we have 
to provide information about the set of possible states as well as the quantum 
system whose state we want to determine with the machine. The information 
about the set of possible states will be the program. 

Let us consider the simplest version of such a device. It will unambiguously 
discriminate between two different qubit states. The program consists of two 
states, one in each of the states we want to discriminate between, which we 
shall call |</'i) and |</' 2 )- The data quint is in cither |?/>i) or |i/' 2 ). and we would 
like to know which. What the machine does is implement a POVM, which 
takes advantage of the symmetry of the three-qubit input state. Let us call 
the program inputs a and 6, and the data input c. Our task is to discriminate 
between the states 

- |^l>a|V>2)b|Vl>c 

|* 2 > = \i>i) a \rh)b\rh)c- (31) 

Note that in the first and third qubits are in the same state, while in 
the second and third qubits are in the same state. Therefore, if we project 
the three-qubit input state onto the antisymmetric subspace of qubits a and c, 
and we get a nonzero result, then we know that qubit c was in the state \ip2}- 
Similarly, if we project qubits b and c onto the antisymmetric subspace of two 
qubits, and we get a nonzero result, then we know that qubit c was in the state 
There will also be a "don't know" result in which the measurement fails, 
and we want to minimize the probability of obtaining this result. If the two 
states are equally likely, and averaging over and \ip2), since we do not know 
what they are, we find that the optimal probability of identifying the input data 
state is 1/6. Note that in this case, the information contained in the program 
was quantum information, in particular, it consisted of examples of quantum 
states, and this necessitated the program itself being quantum. 

Now let us look at an example in which it is useful to apply quantum infor- 
mation processing techniques to quantum programs. In order to do so, we first 
need to explain the quantum search algorithm due to Lov Grover (3D] • We have 
a black box that evaluates a Boolean function. A Boolean function is one whose 
value is either zero or one. We send in an input, which is an n-digit binary 
number, x, and the output of the box is /(x). This particular function is zero 
on all inputs except one, which we shall call xo, and /(xo) = 1. Our object is 
to find xo with a minimum number of uses of the black box. 

Classically, we simply send in different inputs to the black box until we 
find one that gives one as an output. On average we will have to make 2(™~ 1 ) 
tries. The Grover algorithm works in a completely different way, and its result 
is a considerable improvement over the classical one. It starts with an input 
state that is an equal superposition of all possible input values. By successively 
applying the black box followed by an operator Grover called "inversion about 
the mean" approximately 2™/ 2 times, the initial state is rotated into the state 
|x ), and then one simply measures this state in the computational basis to 
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find out what x$ is. Note that the black box was only used 2 n / 2 times in this 
case, which means that the number of evaluations in the Grover algorithm is 
approximately the square root of the number of evaluations that are necessary 
in the classical case. 

Now consider the following problem [51] . We have a set of M permutations 
on N objects. In particular let X = {k\k = 0, 1, ... N — 1} be the set of objects 
being permuted, and let S — {<Jj\j — 1,2,... M} be the set of permutations. 
For some specified ko,ki € X, we are promised that there is one <Tj £ S such 
that er(feo) = hi) and we want to find which permutation satisfies this property. 
A variant of this problem, determining whether there is a aj £ S such that 
c(fco) = ^l, can be used to attack the conjugacy problem in group theory 
If G is a group, and 51,32 G G, we would like to know whether g\ and 32 
are conjugate to each other, that is, whether there is an h € G, such that 
g-2 = hg\h . The connection between this problem and the one involving the 
permutations is provided by realizing that the automorphism : G — > G given 
by cth(g) = hgh~ l is just a permutation on G. Thus, the conjugacy problem is 
reduced to determining whether there is an ah such that ah(gi) = .92- 

We suppose we have a quantum processor, which acts on the Hilbert space 
H-x <£>T~(-s , where Tix is spanned by the orthonormal basis {|fc)x|fc = 0, 1, . . . N — 
1} and Hs is spanned by the orthornormal basis {|j)s|j = 1,2, ...A/}. We 
regard Hs as the program space, and Hx as the data space. The processor acts 
as follows 

\j)s\k)x - \j)sUi\k) x , (32) 

where Uj\k)x — \ G j{k))x- Once we have this processor, we can do a Grover 
search on the programs in order to find the permutation that satisfies a(k ) = k\. 
This will require approximately v M uses of the processor, whereas classically 
M uses would be required. It is the fact that the programs are quantum states 
that allows us to search among them by using a quantum search procedure. 



5 Conclusion 

As we have seen, quantum machines have been developed for a number of in- 
formation processing tasks. Cloners move quantum information around and 
discriminators allow one to distinguish among nonorthogonal quantum states. 
Discriminators can be generalized to distinguish between nonorthogonal sub- 
spaces as well [32 . In addition, we have seen that it is possible to construct 
programmable quantum machines, which are capable of performing a number 
of different tasks. 

The capabilities of programmable machines are still not well understood. We 
concentrated mainly on processors that implement unitary operators, but, as we 
saw processors can also implement more general maps. Some families of maps, 



for example, those in Eq. (12), can be implemented with a finite-dimensional 
program space, while others, such as a one-parameter unitary group, cannot. 
What determines whether a set of maps can be programmed with a finitc- 
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dimensional program? Another issue is the equivalence of programmable pro- 
cessors. Suppose we have two processors, both of which can perform the same 
set of operations but they do so with different programs. This could happen, for 
example, if the one of the processors differed from the other simply by having 
a fixed unitary gate at the input to its program register. Given two processors, 
is there a simple way of telling whether or not the set of operations they can 
implement is the same? These are only two questions about the properties of 
quantum processors, and we suspect there are many more. 
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